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TARGETED HETERO-ASSOCIATION OF RECOMBINANT 
PROTEINS TO MULTI-FUNCTIONAL COMPLEXES 


Background of the Invention 

Increasingly, there is a need for proteins which combine two or more functions, 
such as binding or catalysis, in a single structure. Typically, proteins which 
combine two or more functions are prepared either as fusion proteins or through 
chemical conjugation of the component functional domains. Both of these 
approaches suffer from disadvantages. Genetic "single chain" fusions suffer the 
disadvantages that (i) only a few (2-3) proteins can be fused (Rock et at., 1992, 
Prot. Eng. 5, 583-591), (ii) mutual interference between the component domains 
may hinder folding, and (iii) the size of the fusion protein may make it difficult to 
prepare. The alternative, chemical cross-linking in vitro following purification of 
independently expressed proteins, is difficult to control and invariably leads to 
undefined products and to a severe loss in yield of functional material. 

Recently, methods for achieving non-covalent association of two or more of the 
same functional domains have been developed. This can be achieved through 
the use of domains attached to peptides which self-associate to form homo- 
multimers (Pack & Pluckthun, 1992, Biochemistry 31, 1579-1584). For example, 
the association of two separately expressed scFv antibody fragments by C- 
terminally fused amphipathic helices in vivo provides homo-dimers of antibody 
fragments in E. coli (PCT/EP93/00082; Pack et al., 1993, Biotechnology 11, 
1271-1277) or homo-tetramers;(Pack et al., 1995, J. Mol. Biol., 246, 28-34). 

To assemble distinct protein functions such as two antibody fragments with 
different specificities fused to such association domains, the helices must have a 
tendency to form hetero-multimers. In principle, this could be achieved with 
complementary helices such as the hetero-dimerizing JUN and FOS zippers of 
the AP-1 transcription factor (O'Shea et al., 1992, Cell 68, 699-708). The clear 
disadvantage of association domains based on hetero-associating helices, 
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however, is their pseudo-symmetry and their similar periodicity of hydrophobic 
and hydrophilic residues. This structural similarity results in a strong tendency to 
form homo-dimers and, thus, to lower significantly the yield of hetero-dimers 
(O’Shea et al., 1992, Cell 68, 699-708; Pack, 1994, Ph. D. thesis, Ludwig- 
Maximilians-Universitat Munchen). Furthermore, the formation of JUN/FOS 
hetero-dimers is kinetically disfavoured and requires a temperature-dependent 
unfolding of the kinetically favoured homo-dimers, especially JUN/JUN homo¬ 
dimers (PCT/EP93/00082; O’Shea et al., 1992, Cell 68, 699-708; Pack, 1994, Ph. 
D. thesis, Ludwig-Maximilians-UniversitSt Munchen). Because of the need for 
additional purification steps to separate the unwanted homo-dimers from hetero¬ 
dimers and the resulting decrease in yield, hetero-association domains based on 
amphipathic helices do not result in practical advantages compared to 
conventional chemical coupling. 

These disadvantages of the prior art are overcome by the present invention 
which provides multi-functional polypeptides and methods for the preparation of 
these multi-functional proteins. This is achieved via the use of association 
domains which are designed to associate predominantly in a complementary 
fashion, and not to self-associate. 


Detailed Description of the Invention 

In the earliest steps of protein folding, peptide chains form a disordered 
hydrophobic core by collapsing hydrophobic residues into the interior of an 
intermediate "molten globule". This hydrophobic effect is considered to be the 
most important driving force of folding (Matthews, 1993, Annu. Rev. Biochem. 62, 
653 - 683; Fersht, 1993, FEBS Letters 325, 5 - 16). The burial of hydrophobic 
residues and the resulting exclusion of solvent is the determining factor in the 
stability of compact tertiary structures such as acyl-phosphatase (Pastore et al, J. 
Mol. Biol. 224, 427-440, 1992) interleukin-2 (Brandhuber t al., 1987, Science 
238, 1707 - 1709), calbindin (Parmentier, 1990, Adv. Exp. Med. Biol. 269, 27-34) 
or ubiquitin (Briggs & Roder, 1992, Proc. Natl. Acad. Sci. USA 89, 2017 - 2021). 
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This concept forms the basis of the present invention, which provides individually 
encoded peptides or “segments" which, in a single continuous chain, would 
comprise a compact tertiary structure with a highly hydrophobic core. The 
component peptides are chosen so as to be asymmetric in their assumed 
structure, so as not to self-associate to form homo-multimers, but rather to 
associate in a complementary fashion, adopting a stable complex which 
resembles the parent tertiary structure. On the genetic level, these segments are 
encoded by interchangeable cassettes with suitable restriction sites. These 
standardized cassettes are fused C- or N-terminally to different recombinant 
proteins via a linker or hinge in a suitable expression vector system. 

Thus, the present invention relates to a multi-functional polypeptide comprising. 

(a) a first amino acid sequence attached to at least one functional domain, 

(b) a second amino acid sequence attached to at least one further functional 
domain; and 

(c) optionally, further amino acid sequences each attached to at least one 
further functional domain; 

wherein any one or more of said amino acid sequences interacts with at least one 
of said amino acid sequences in a complementary fashion to form a parental, 
native-like tertiary or optionally quaternary structure and wherein the parental, 
native-like tertiary or optionally quaternary structure is derived from a single 
parent polypeptide. In this context, the term parent polypeptide refers to a 
polypeptide which has a compact tertiary or quarternary structure with a 
hydrophobic core. The invention provides for many different parent polypeptides 
to be used as the basis for the association domain. Suitable polypeptides can be 
identified by searching for compact, single-domain proteins or protein fragments 
in the database of known protein structures (Protein Data Bank, PDB) and 
selecting structures that are stable and can be expressed at high yields in 
recombinant form. These structures can then be analyzed for hydrophobic sub¬ 
clusters by the method of Karpeisky and llyn (1992, J. Mol. Biol. 224, 629-638) or 
for structural units (such as IJ-elements or helical hairpin structures) by standard 
molecular modelling techniques. In a further embodiment, the present invention 
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provides for multi-functional polypeptides wherein the single parent polypeptide is 
taken from the list ubiquitin, acyl phosphatase, IL-2, calbindin and myoglobin. 

In a preferred embodiment, the present invention provides a multi-functional 
polypeptide comprising two or more amino acid sequences each attached to at 
least one functional domain, wherein any two or more of said amino acid 
sequences can associate in a complementary fashion to provide a parental, 
native like, tertiary or optionally quaternary structure. 

Once structural sub-domains are identified, the protein is dissected in such a way 
these sub-domains remain intact. The selection process can be expanded to 
proteins for which no structure is available but which satisfy the criteria of stability 
and good expression. For these proteins, folding sub-domains can be determined 
by hydrogen exchange pulse-labelling of backbone amides during the folding 
reaction, followed by NMR detection in the native state (Roder et al., 1988, 
Nature 355, 700-704; Udgaonkar & Baldwin, 1988, Science 255, 594-597). 
Alternatively, folding sub-domains can be identified by mild proteolysis, 
denaturation, purification of fragments and reconstitution in vitro (Tasayco & 
Carey, 1992, Science 255, 594-597; Wu et al., 1993, Biochemistry 32, 10271- 
10276). Finally, additional clues for the choice of cleavage sites can be obtained 
from the exon structure in the case of eukaryotic proteins, since the exons 
frequently (though not always) correspond to structural sub-domains of a protein. 
This has, for example, been discussed for the case of myoglobin (Go 1981, 
Nature 291, 90). 

The yield of properly assembled molecules is expected to decrease significantly 
for constructs in which a protein domain is divided into three or more parts. This 
is due to the fact that several sub-domains must come together simultaneously to 
form a viable structure. This effect is countered by dividing the polypeptide chain 
into sub-domains that represent folding units (identified by the methods described 
above). Thus, not only the final, assembled complex but also assembly 
intermediates will have the stability necessary to allow their accumulation in the 
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host during expression, resulting in a greatly improved kinetic behaviour of the 
system. 

In solution, the isolated segments have little secondary structure and remain 
monomeric or form transient, non-specific and easily disrupted aggregates. Only 
upon mixing, either by separate expression and purification, or by co-expression, 
can the concerted folding of complementary segments provide the necessary 
intermediate interaction of residues (Matthews, 1993, Annu. Rev. Biochem. 62, 
653 - 683) that results in the formation of a compact, native-like structure. This 
association, mainly driven by the burial of hydrophobic residues of all segments 
into a single hydrophobic core, leads to a targeted assembly of the N- or C- 
terminally fused proteins to a multi-functional complex in vivo or in vitro. 

Optionally, the reconstituted native-like structure may also contribute an 
enzymatic or binding activity to increase the number of effector functions in the 
assembled complex. Accordingly, the present invention also provides a multi¬ 
functional polypeptide as described above, in which the native-like, tertiary or 
quaternary structure provides a biological activity. For example, when acyl 
phosphatase is used as the basis of the association domain, it is expected that 
the multi-functional polypeptide will retain some phosphatase activity. 

The present invention provides for many different types of functional domains to 
be linked into the multi-functional polypeptide. Particularly preferred are cases in 
which one or more, preferably two, of said functional domains are fragments 
derived from molecules of the immunoglobulin superfamily. In particularly 
preferred embodiments, said fragments are antibody fragments. Also preferred 
are cases in which at least one of the functional domains possesses biological 
activity other than that associated with a fragment derived from a member of the 
immunoglobulin superfamily. By way of example, the present invention provides 
for the targeted assembly of enzymes, toxins, cytokines, peptide hormones, 
immunoglobulins, metal binding domains, soluble receptors, lectins, lipoproteins, 
purification tails and bioactive peptides to multi-functional complexes (Fig. 1) 
based on a modular system of expression vectors, restriction sites and plug-in 
gene cassettes coding for assembly segments, peptide linkers and functional 
domains (Fig.2). 
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If covalent linkage between the segments is necessary to prevent dissociation at 
low concentrations, cysteines can be introduced to form inter-segmental 
disuiphide bridges between the amino acid sequences which comprise the 
association domain (Ecker et al., 1989.J. Biol. Chem. 264, 1887-1893; Pack & 
Pluckthun, 1992, Biochemistry 31 ,1579-1584). Accordingly, the present invention 
provides multi-functional polypeptides wherein the folding of the component 
amino acid sequences is stabilized by a covalent bond. 

In order to provide some flexibility between the association domain and the 
appended functional domains, it may be desired to incorporate a linker peptide. 
Accordingly, the present invention provides for multi-functional polypeptides of 
the type described above wherein at least one of the functional domains is 
coupled to said amino acid sequence via a flexible peptide linker. By way of 
example, the flexible linker may be derived from the hinge region of an antibody. 
The invention enables even more complex multi-functional polypeptides to be 
constructed via the attachment of at least one further (poly)peptide to one or 
more of said amino acid sequences. By way of example, the further (poly)peptide 
can be taken from the list enzymes, toxins, cytokines, peptide hormones, 
immunoglobulins, metal binding domains, soluble receptors, lectins, lipoproteins, 
purification tails, in particular peptides which are able to bind to an independent 
binding entity, bioactive peptides, preferably of 5 to 15 amino acid residues, 
metal binding proteins, DNA binding domains, transcription factors and growth 
factors. 

For therapeutic purposes, it is often desirable that proteinaceous substances 
display the minimum possible immunogenicity. Accordingly, the present invention 
provides for multi-functional polypeptides as described above in which at least 
one of said amino acid sequences, functional domains, or further (poly)peptides 
is of human origin. 

In addition to the peptides and proteins provided above, the present invention 
also provides for DNA sequences, vectors, preferably bicistronic vectors, vector 
cassettes, characterised in that they comprise a DNA sequ nee encoding an 
amino acid sequence and optionally at least one further (poly)peptide comprised 
in the multifunctional polypeptide of the invention, and additionally at least one, 
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preferably singular cloning sites for inserting the DNA encoding at least one 
further functional domain or that they comprise DNA sequences encoding the 
amino acid sequences, and optionally the further (poly)peptide(s) comprised in 
the multifunctional polypeptide of the invention and suitable restriction sites for 
the cloning of DNA sequences encoding the functional domains, such that upon 
expression of the DNA sequences after the insertion of the DNA sequences 
encoding the functional domains into said restriction sites, in a suitable host the 
multifunctional polypeptide of the invention is formed. In a preferred embodiment 
said vector cassette is characterised in that it comprises the inserted DNA 
sequence(s) encoding said functional domain(s) and host cells transformed with 
at least one vector or vector cassette of the invention which can be used for the 
preparation of said multi-functional polypeptides. 

In a further preferred embodiment, said host cell is a mammalian, preferably 
human, yeast, insect, plant or bacterial, preferably E. coli cell. 

The invention further provides for a method for the production of a multifunctional 
polypeptide of the invention, which comprises culturing the host cell of the 
invention in a suitable medium, and recovering said multifunctional polypeptide 
produced by said host cell. 

In a further embodiment, the invention relates to a method for the production of a 
multifunctional polypeptide of the invention which comprises culturing at least two 
host cells of the invention in a suitable medium, said host cells each producing 
only one of said first and said second amino acid sequences attached to at least 
one further functional domain, recovering the amino acid sequences, mixing 
thereof under mildly denaturing conditions and allowing in vitro folding of the 
multifunctional polypeptide of the invention from said amino acid sequences. 

In a particular preferred embodiment, said method is characterised in that the 
further amino acid sequences attached to at least one furth r functional domain 
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are/is produced by at least one further host cell not producing said first or second 
amino acid sequence. 

In another particularly preferred embodiment of the invention, said method is 
characterised in that at least one further amino acid sequence attached to at 
least one further functional domain is produced by the host cell of the invention 
producing said first or second amino acid sequence. 

In further preferred embodiments, the present invention provides for 
pharmaceutical and diagnostic compositions comprising the multi-functional 
polypeptides described above, said pharmaceutical compositions optionally 
comprising a pharmaceutically acceptable carrier. Finally, the invention provides 
for a kit comprising one or more vector cassettes useful in the preparation of said 
multi-functional polypeptides. 

The invention is now illustrated by reference to the following examples, which are 
provided for the purposes of illustration only and are not intended to limit the 
scope of the invention. 
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Example 1: Segmented human ubiquitln as an assembly device 

Ubiquitin is a compact intracellular protein of only 76 residues (Fig. 3) and a 
molecular weight of 5 kDa. It shows the highest conservation among all known 
proteins and is involved in the degradation pathway of intracellular eukaryotic 
proteins by forming intermediate isopeptide bonds to its C-terminus and to Lys48 
(Hershko & Ciechanover, 1992, Ann. Rev. Biochem. 61, 761-807). 

To use ubiquitin as an assembly device, the unwanted function can be abolished 
by truncation of the last three C-terminal residues (-Arg-Gly-Gly), and the 
exchange of Lys48 to Arg, which prevents the formation of isopeptide bonds to 
this residue. The altered sequence is then divided in a loop at position Gly36, so 
that the hydrophobic core falls apart into two segments (called ALPHA and 
BETA). The synthetic nucleotide sequence of the segments (Fig 4, 5) carry 
appropriate restriction sites (Mrol-Hindlll) at the termini, so that the cassette 
encoding the segments can be easily ligated to a EcoRI-Mrol cassette encoding 
the flexible linker (hinge of hulgG3; Fig. 6). The cassettes are inserted into the 
expression vector plG3 (EcoRI-Hindlll; Fig. 7) encoding the scFv fragment of the 
antibody McPC603 under the lac promoter/operator (Ge et al., 1995, in: Antibody 
engineering: A practical approach. IRL Press, New York, Borrebaeck ed., 229- 
261). Insertion of a second functional fragment (scFv fragment of the anti-IJ- 
lactam antibody 2H10 with phoA signal sequence) linked to association segment 
BETA as an Xbal-Hindlll DNA fragment (Fig. 8) results in a di-cistronic 
expression vector (Pack, 1994, Ph. D. thesis, Ludwig-Maximilians-UniversitSt 
Munchen). After induction with IPTG and translation, the signal sequences guide 
the antibody fragments fused to the assembly segments to the periplasm, where 
they assemble to a complex with a reconstituted native-like ubiquitin fold and two 
different antibody specificities. The complex, a bispecific immunoglobulin, can be 
recovered and purified by affinity chromatography of cell extract (Pack, 1994, Ph. 
D. thesis, Ludwig-Maximilians-Universitat Munchen). 
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Example 2: Covalent linkage of the native-like tertiary structure of 
the assembly device by engineered disulphide bridges and 
combination of a C-terminal peptide Sinker with an in-frame 
restriction site. 

The conformational stability of undivided, native ubiquitin can be enhanced by 
introduction of disulfides at positions 4 and 66 without perturbation in the 
backbone (Ecker et al„ 1989, J. Biol. Chem. 264, 1887-1893; Fig. 9). In the 
context of this invention, the engineering of disulfide bridges provides the 
covalent linkage of segments (Fig. 10,11) after co-folding and assembly. 

To raise the number of possible functional domains in the assembled complex, a 
C-terminal peptide can be fused to one or more of the segments of the assembly 
device. To fuse a functional domain like an enzyme, cytokine, antibody fragment, 
purification peptide or toxin to this linker, a restriction site, preferably unique, has 
to be introduced in-frame (Fig. 11). Gene synthesis, cloning, expression as well 
as recovery of the assembled, covalently linked complex is according to example 
1 . 


Example 3: Segmented human interleukin-2 (IL2) as an assembly 
device 

Human interleukin-2 (Brandhuber et al., 1987, Science 238, 1707 - 1709; Kuziel 
& Greene, 1991, in: The Cytokine Handbook. Academic Press, 84-100) is used 
as an assembly device by segmentation between position His79 and Lys 80 (Fig. 
12). The device, encoded by Mrol-Ascl-Hindll gene cassettes (Fig. 13, 14) 
combines the low immunogenicity of the plasmatic protein with a preferable 
effector function of the native-like cytokine structure and an inter-segmental 
cysteine bridge (Cys58-Cys105) after assembly. The combination of one or more 
antibody fragments against tumor antigens with additional cytokines like IL6 or 
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IL12 targets the multi-cytokine complex (Rock et a!., 1992, Prot. Eng. 5, 583-591) 
directly to the tumour. 


Example 4 : Segmented human apomyoglobin as an assembly 
device with three segments 

To use more than two segments of a native structure as an assembly device, the 
hydrophobic interface between the segments has to be large enough to provide 
the sufficient hydrophobic interaction for non-covalent linkage. Myoglobin 
(Fig. 15) is expressible in large amounts in £. coli (Guillemette et al., 1991, 
Protein Eng: 4, 585-592). Up to six functional domains can be assembled by a 
threefold segmented structure (Fig. 16, 17, 18), three at the N-termini and three 
at the C-termini of the segments. The presence of heme additionally stabilizes 
the native-like apomyoglobin fold and can be used as a switch to influence the 
association constant of the multi-functional complex. 


Example 5: Bioactive peptides as functional domains 

Certain peptides derived from amphipathic loop structures of LPS-binding 
proteins (Hoess et al., 1993, EMBO J. 12, 3351-3356) are able to neutralize 
endotoxin. This effect is enhanced by multivalent display of these short peptides 
(10-15 residues; Hoess, unpublished results). The present invention provides a 
method to express and assemble several of short peptides (Fig. 19), fused to an 
assembly segment, in a multivalent complex or in combination with other 
functional domains. The peptides can be fused either to the N-or to the C- 
terminus (Fig. 20, 21) of the assembly domain via the peptide linkers. 
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Example 6: A purification tail for IMAC as a functional domain 


Peptide tails consisting of histidines are able to coordinate metal ions. They are 
used for purification of native proteins in immobilized metal affinity 
chromatography (IMAC). Multivalent display of the purification tail considerably 
improves the maximum purity achievable by IMAC (Lindner et al., 1992, Methods: 
a companion to methods in enzymology 4, 41-56). One or more gene cassettes 
(Fig. 22) encoding a polyhistidine tail can be fused to the assembly segment to 
provide a simple and efficient purification method for multi-functional complexes. 


Example 7: The platelet aggregation inhibitor decorsin as a 
functional domain 

Decorsin, a 39 residue protein of the leech Macrobdella decora (Fig. 23), acts as 
a potent antagonist of the platelet glycoprotein llb-llla (Seymour et al., 1990, J. 
Biol. Chem. 265, 10143-10147). The gene cassette encoding the decorsin can be 
fused C- or N-terminally to an association segment (Fig. 24, 25). In arterial 
thrombotic deseases, a multivalent decorsin complex combined with an anti-fibrin 
antibody fragment can act as a powerful antithrombotic agent. 
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Claims 


1. A multifunctional polypeptide comprising: 

(a) a first amino acid sequence attached to at least one functional 
domain; 

(b) a second amino acid sequence attached to at least one further 
functional domain; and 

(c) optionally, further amino acid sequences each attached to at 
least one further functional domain; 

wherein any one or more of said amino acid sequences interacts with at 
least one of said amino acid sequences in a complementary fashion to 
form a parental, native-like tertiary or optionally quaternary structure and 
wherein said parental, native-like tertiary or optionally quaternary 
structure is derived from a single parent polypeptide. 

2. The multifunctional polypeptide according to claim 1, wherein said single 
parent polypeptide is ubiquitin, acyl-phosphatase, IL2, calbindin or 
apomyoglobin. 

3. The multifunctional polypeptide according to claim 1 or 2, wherein said 
parental, native-like tertiary or optionally quaternary structure is 
biologically active. 

4. The multifunctional polypeptide according to any one of claims 1 to 3, 
wherein at least one of said functional domains is a fragment derived 
from a member of the immunoglobulin superfamily. 

5. The multifunctional polypeptide according to claim 4, wherein two of said 
functional domains are fragments derived from members of the 
immunoglobulin superfamily. 
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6. The multifunctional polypeptide according to claim 4 or 5, wherein said 
fragments are antibody fragments. 

7. The multifunctional polypeptide according to any one of claims 1 to 6, 
wherein at least one of said functional domains is a biologically active 
molecule or a derivative thereof other than a fragment derived from a 
member of the immunoglobulin superfamily. 

8. The multifunctional polypeptide according to any one of claims 1 to 6, 
wherein the folding of the amino acid sequences is stabilised by a 
covalent bonding. 

9. The multifunctional polypeptide according to any one of claims 1 to 8, 
wherein at least one of said functional domains is coupled to said amino 
acid sequence(s) via a flexible peptide linker. 

10. The multifunctional polypeptide according to claim 9, wherein said 
flexible peptide linker is an antibody hinge region. 

11. The multifunctional polypeptide according to any one of calims 1 to 10, 
wherein at least one of said amino acid sequences is coupled to at least 
one further (poly)peptide. 

12. The multifunctional polypeptide according to claim 11, wherein said 
further (poly)peptide is an enzyme, a toxin, a cytokine, a metal binding 
site, a metal binding protein, a soluble receptor, a DNA-binding domain, 
a transcription factor, an immunoglobulin, a bioactive peptide of 5 to 15 
amino acid residues, a peptide hormone, a growth factor, a lectin, a 
lipoprotein, and a peptide which is able to bind to - an independent 
binding entity. 
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13. The multifunctional polypeptide according to any one of claims 1 to 12, 
wherein at least one of said amino acid sequences, functional domains 
or further (poly)peptide(s) is of human origin. 

14. A DNA sequence encoding an amino acid sequence and at least one 
functional domain and, optionally, at least one further functional 
(poly)peptide comprised in the multifunctional polypeptide of any one of 
claims 1 to 13. 

15 . A vector comprising at least one DNA molecule of claim 14. 

16. The vector of claim 15, which is a bicistronic vector. 

17. A vector cassette characterised in that it comprises a DNA sequence 
encoding an amino acid sequence and optionally at least one further 
(poly)peptide comprised in the multifunctional polypeptide of any one of 
claims 1 to 13, and additionally at least one, preferably a singular cloning 
site for inserting the DNA encoding at least one further functional 
domain. 

18. A vector cassette characterised in that it comprises DNA sequences 
encoding the amino acid sequences, and optionally the further 
(poly)peptide(s) comprised in the multifunctional polypeptide of any one 
of claims 1 to 13, and suitable restriction sites for the cloning of DNA 
sequences encoding the functional domains, such that upon expression 
of the DNA sequences after the insertion of the DNA sequences 
encoding the functional domains into said restriction sites, in a suitable 
host the multifunctional polypeptide according to any one of claims 1 to 


13 is formed. 
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19. The vector cassette according to claim 17 or 18 characterised in that it 
comprises the inserted DNA sequence(s) encoding said functional 
domain(s). 

20. A host cell transformed with at least one vector according to claim 15 or 
16, or at least one vector cassette according to claim 19. 

21. The host cell according to claim 20, which is a mammalian, perferably 
human, yeast, insect, plant or bacterial, preferably E. coli cell. 

22. A method for the production of a multifunctional polypeptide according to 
any one of claims 1 to 13, which comprises culturing the host cell 
according to claim 20 or 21 in a suitable medium, and recovering said 
multifunctional polypeptide produced by said host cell. 

23 . A method for the production of a multifunctional polypeptide according to 
any one of claims 1 to 13 which comprises culturing at least two host 
cells according to claim 20 or 21 in a suitable medium, said host cells 
each producing only one of said first and said second amino acid 
sequences attached to at least one further functional domain, recovering 
the amino acid sequences, mixing thereof under mildly- denaturing 
conditions and allowing in vitro folding of the multifunctional polypeptide 
according to any one of claims 1 to 13 from said amino acid sequences. 

24. The method according to claim 23, wherein the further amino acid 
sequence(s) (each) attached to at least one further functional domain 
are/is produced by at least one further host cell not producing said first 
or second amino acid sequence. 
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The method according to claim 23, wherein at least one further amino 
acid sequence attached to at least one further functional domain is 
produced by the host cell according to claim 20 or 21 producing said first 
or second amino acid sequence. 

A pharmaceutical composition comprising the multifunctional polypeptide 
according to any one of claims 1 to 13 optionally in combination with a 
pharmaceutically acceptable carrier. 

A diagnostic composition comprising the multifunctional polypeptide 
according to any one of claims 1 to 13. 

A kit comprising at least one vector cassette according to claim 17 or 18. 
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Pig.l: segmented tertiary structure for a targeted hetero-association 



compact tertiary structure 



non-native 


non-native 





enzyme 


coexpression or cofolding 



RECTIFIED SHEET (RULE 91) 

ISA/EP 
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Fia 2 : Modular cistron encoding functional domains 

N- and/or C-terminally fused to the assembly device 



RBCnREO SHEET (RULE 
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Fig. 3 : protein sequence of human ubiguitin (segmented 

after Gly35) 


i 10 20 30 *- 40 50 

MQIFVKTLTG KTITLEVEPS DTIENVKAKI QDKEGIPPDQ QRLIFAGKQL 

60 70 

edgrtlsdyn iqkestlhlv lrlrgg** 


Fig. 4: Mrol-Hind III gene cassette encoding for segment 
ALPHA of ubiquitin 


Mrol „ 

S G M Q I PVKTL 
TCC GGA ATG CAG ATC TTC GTT AAA ACC 
9 18 27 

AGG CCT TAC GTC TAG AAG CAA TTT TGG 


T G K T I T L 

CTG ACC GGT AAA ACC ATC ACC 
36 45 

GAC TGG CCA TTT TGG TAG TGG 


CTG 

GAC 


GAA 

54 

CTT 


vePSDTIENVKAKIQ 

GTT GAA CCG TCT GAC ACC ATC GAA AAC GTT AAA GCT AAA ATC CAG 

63 72 81 90 99 

CAA CTT GGC AGA CTG TGG TAG CTT TTG CAA TTT CGA TTT TAG GTC 


D 

GAC 

CTG 


K E 
AAA GAA 
108 
TTT CTT 


Hindlll 

G * w A 

GGT TGA TAA GCT T 3' 
117 

CCA ACT ATT CGA A 5' 
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Fig. 5: Mrol-Hind III gene cassette encoding for segment 
BETA of ubiquitin 


S GlPPDQQRLIFAGRQl*B 

TCC GGA ATC CCG CCG GAC CAG CAG CGT CTG ATC TTC GCT GGT CGT CAG CTG GAA 

9 18 27 36 45 54 

AGG CCT TAG GGC GGC CTG GTC GTC GCA GAC TAG AAG CGA CCA GCA GTC GAC CTT 


D G R T L 
GAC GGT CGT ACC CTG 
63 

CTG CCA GCA TGG GAC 


SDYNIQK.. ESTLHL 

TCT GAC TAC AAC ATC CAG AAA GAA TCT ACC CTG CAC CTG 

72 81 90 99 108 

AGA CTG ATG TTG TAG GTC TTT CTT AGA TGG GAC GTG GAC 


HindZZZ 


V L R L 
GTT CTG CGT CTG TGA TAA 3' 
117 126 

CAA GAC GCA GAC ACT ATT S’ 


Fig. 6: EcoRI-MroI gene cassette encoding a flexible linker 
(huIgG3) 


EcoRI M r 0 1 _ 

EFTPLGDTTHTSG 

5• GAA TTC ACC CCG CTG GGT GAC ACC ACC CAC ACC TCC GGA 3' 
9 18 27 36 

3 * CTT AAG TGG GGC GAC CCA CTG TGG TGG GTG TGG AGG CCT 5' 
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Construction of monocistronic expression vector 



Jfeal 


EcoRI Mrol Mrol 


Assembly 

segment 



HindiII 



RECTIFIED SHEET (RULE 91) 
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Fig. 8: Construction of dicistronic co-expression vectors 



segment ALPHA segment BETA 


insert second cistron (XbaI/HindIII-digested,blunt) 
Jntr> vprtnr fXhal-dieested. blunts 



RECTIFIED SHEET (RULE 91) 

ISA/EP 
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Fig. 9 : protein sequence of human ubiquitin with 
intersegmental disufides Cys4 and Cys66 (segmented after 
Gly35) 

! 10 20 30 V 40 50 

MQICVKTLTG KTITLEVEPS DTIENVKAKI QDKEGIPPDQ QRLIFAGKQL 


60 70 

EDGRTLSDYN IQKESCLHLV LRLRGG** 


Fig. 10: Mrol-Hind III gene cassette encoding for segment 
ALPHA-CYS4 of ubiquitin 


Mrol 


s 

G 

M 

Q 

I 

C 

V 

K 

T 

TCC 

GGA 

ATG 

CAG 

ATC 

TGC 

GTT 

AAA 

ACC 



9 



18 



27 

AGG 

CCT 

TAC 

GTC 

TAG 

ACG 

CAA 

TTT 

TGG 

V 

E 

P 

S 

D 

T 

I 

E 

N 

GTT 

GAA 

CCG 

TCT 

GAC 

ACC 

ATC 

GAA 

AAC 



63 



72 



81 

CAA 

CTT 

GGC 

AGA 

CTG 

TGG 

TAG 

CTT 

TTG 



HindllZ 





G 

* 

* 

A 






GGT 

TGA 

TAA CCT 

T 3 

« 






117 







CCA 

ACT 

ATT 

CGA 

A 5 

i 





36 


45 


54 


K 


90 


99 


108 
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Fig. 11 : Mrol-Ascl-Hind III gene cassette encoding for 
segment BETA-CYS6 6 with C-terminal GGSGGAP linker or 
ubiquitin 


sgippdqqri*i fAGR ® le 
TCC GGA ATC CCG CCG GAC CAG CAG CGT CTG ATC TTC GCT GGT CGT CAG CTG GAA 

9 18 27 36 45 54 

AGG CCT TAG GGC GGC CTG GTC GTC GCA GAC TAG AAG CCA CCA GCA GTC GAC CTT 


dgrtlsdyniqkesclhl 

GAC GGT CGT ACC CTG TCT GAC TAC AAC ATC CAG AAA GAA TCT TGC CTG CAC CTG 

63 72 81 90 99 108 

CTG CCA GCA TGG GAC AGA CTG ATG TTG TAG GTC TTT CTT AGA ACG GAC GTG GAC 


AecI Hindlll 

VLRLCOSCCAP * * 

GTT CTG CGT CTG CGO GGG ACC CCA GGC GCG CCG TGA TAA 3 
117 126 

CAA GAC GCA GAC CCC CCC TCC CCT CCG CGC GGC ACT ATT 5 


Fig. 12: Protein sequence of human IL-2 (segmented after 
His79) 

10 20 30 40 50 60 

APTSSSTKKT QLQLEHLLLD LQMILNGINN YKNPKLTRML TFKFYMPKKA TELKHLQCLE 

70 S«r- 90 ,100 110 120 

EELKPLEEVL NLAQSKNFHL RPRDLISNIN VIVLELKGSE TTFMCEYADE TATIVEFLNR 

130 

WITFCQSIIS TLT 
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Fig.13 j MroI-Ascl-Hind III gene cassette encoding for 
segment ALPHA of human IL-2 


SGAPTSSSTKKTQL 

TCC GGA CCA CCT ACT TCA ACT TCT ACA AAG AAA ACA CAG CTA 
9 18 27 36 

AGG CCT CGT GGA TGA AGT TCA AGA TCT TTC TTT TGT CTC GAT 


Q 

CAA 

45 

GTT 


L E H 
CTG GAG CAT 
54 

GAC CTC GTA 


llldlqmil 
TTA CTG CTG GAT TTA CAG ATG ATT TTG 
63 72 81 

AAT GAC GAC CTA AAT GTC TAC TAA AAC 


K L T R H L T F K 

AAA CTC ACC AGG ATG CTC ACA TTT AAG 
117 126 135 

TTT GAG TGG TCC TAC GAG TGT AAA TTC 


LKHLQCLEE 
CTG AAA CAT CTT CAG TGT CTA GAA GAA 
171 180 189 

GAC TTT GTA GAA GTC ACA GAT CTT CTT 


nlaqsknfh 
AAT TTA GCT CAA AGC AAA AAC TTT CAC 
225 234 243 

TTA AAT CGA GTT TCG TTT TTG AAA CTG 


NG INNYKNP 
AAT GGA ATT AAT AAT TAC AAG AAT CCC 
90 99 108 

TTA CCT TAA TTA TTA ATG TTC TTA GGC 


FYMP K.KATE 

TTT TAC ATG CCC AAG AAG GCC ACA GAA 
144 153 162 

AAA ATG TAC GGG TTC TTC CGG TGT CTT 


elkpleevl 
GAA CTC AAA CCT CTG GAG GAA GTC CTA 
198 207 216 

CTT GAG TTT GGA GAC CTC CTT CAC GAT 


AeeZ Hindi!I 

ggsggap * 
GGG GGG AGC GGA GGC CCG CCG TGA T 
252 261 

CCC CCC TCG CCT CCG CGC GGC ACT A 
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Fig. 14 : Mrol-Ascl-Hind III gene cassette encoding for 
segment BETA of human IL>-2 


gGLRPRDLIS HlNVI 

TCC GGA TTA AGA CCC AGG GAC TTA ATC AGC AAT ATC AAC CTA ATA 

9 18 27 36 45 

AGG CCT AAT TCT GGG TCC CTG AAT TAG TCG TTA TAG TTG CAT TAT 


V L E 
GTT CTG GAA 
54 

CAA GAC CTT 


lkcsettfmceyade 
CTA AAG GGA TCT GAA ACA ACA TTC ATG TGT GAA TAT GCT GAT GAG 

63 72 81 90 99 

GAT TTC CCT AGA CTT TGT TGT AAG TAC ACA CTT ATA CGA CTA CTC 


TAT 
ACA GCA ACC 
108 

TGT CGT TGG 


IVEFLNRWITFCQSI 

ATT CTA GAA TTT CTG AAC AGA TGG ATT ACC TIT TGT CAA AGC ATC 

117 126 135 144 153 

TAA CAT CTT AAA GAC TTG TCT ACC TAA TGG AAA ACA GTT TCG TAG 


1ST 
ATC TCA ACA 
162 


TAG ACT TGT 


L T G 


ABCl 

G S G G A 


P 


HindiII 
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Pig. 15 Protein sequence of human apomyoglobin (cut after 
Lys47 and Lys98) 

Jr- 

mglsdgewql vlnvwgkvea dipghgqevl irlfkghpet lekfdkfkhl 

51 • **“. 

ksedemkase dlkkhgatvl talggilkkk ghheaeikpl aqshatkhkx 

pvkylefise ciiqvlqskh pgdfgadaeg amnkalelfr kdmasnykel 

151 

gfqg 


Fig. 16 : Mrol-Ascl-Hind III gene cassette encoding for 
segment ALPHA of human apomyoglobin 


S D G E W Q L 
TCT GAC GGT GAA TGG CAG CTG 
18 27 36 

AGA CTG CCA CTT ACC GTC GAC 


KVEADI PGH GQE 
AAA GTT GAA GCT GAC ATC CCG GGT CAC GGT CAG GAA 
63 72 81 90 

ITT CAA CTT CGA CTG TAG GGC CCA GTG CCA GTC CTT 


KGHPETLEKF DK 

AAA GGT CAC CCG GAA ACC CTG GAA AAA TTC GAC AAA 

117 126 135 144 

TTT CCA GTG GGC CTT TGG GAC CTT TTT AAG CTG TTT 


AscI HindllX 

GAP* 

GGC GCG CCG TGA T 3' 
171 

CCG CGC GGC ACT A 5' 


V L N V W G 
GTT CTG AAC GTT TGG GGT 
45 54 

CAA GAC TTG CAA ACC CCA 


V L I R L F 

GTT CTG ATC CGT CTG TTC 
99 108 
CAA GAC TAG GCA GAC AAG 


? K G G S G 

rTC AAA GGG GGG AGC GGA 
153 162 

AAG TTT CCC CCC TCG CCT 


HroX 

S G M G L 
TCC GGA ATG GGT CTG 
9 

AGG CCT TAC CCA GAC 
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Fig. 17: Mrol-Ascl-Hind III gene cassette encoding for 
segment BETA of human apomyogl'obin 


sghlksede mk ase dlkk 

TCC GGA CAC CTG AAA TCT GAA GAC GAA ATG AAA GCA TCT GAA GAC CTG AAA AAA 
AGG CCT GTG GAC TTT AGA CTT CTG CTT TAC TTT CCT AGA CTT CTG GAC TTT TTT 


hgatvltalggil 

CAC GGT GCT ACC GTT CTG ACC GCT CTG GGT GGT ATC CTG 
63 72 81 90 

GTG CCA CGA TGG CAA GAC TGG CGA GAC CCA CCA TAG GAC 


K 

AAA 

TTT 


K K G H 
AAA AAA GGT CAC 
99 108 
TTT TTT CCA GTG 


H E A E 
CAC GAA GCT GAA 
117 

GTG CTT CGA CTT 


1 k P L A Q S H A T K H K G 

ATC AAA CCG CTG GCT CAG TCT CAC GCT ACC AAA CAC AAA GGG 

126 135 144 153 162 

TAG TTT GGC GAC CGA GTC AGA GTG CGA TGG TTT GTG TTT CCC 


AscI HindXXX 

G S G G A P * 

GGG AGC GGA GGC GCG CCG TGA T 3* 

171 180 

CCC TCG CCT CCG CCC GGC ACT A 5* 
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Fig. 18: Mrol-Ascl-Hind III gene cassette encoding for 
segment GAMMA of human apomyoglobin 


g j pyj^ylefiseci 
TCC GGA ATC CCC GTT AAA TAG CTG GAG TTC ATC TCT GAA TGC ATC 

“9 18 27 36 45 

AGG CCT TAG GGC CAA TTT ATG GAC CTC AAG TAG AGA CTT ACG TAG 


I 

ATC 

TAG 


Q V 
CAG GTT 
54 

GTC CAA 


r oSKHPGDFGADAEGAMN 

CTG CAG TCT AAA CAC CCG GGT GAC TTC GGT GCT GAC GCT GAA GGT GCT ATG AAC 

63 72 81 90 99 108 

GAC GTC AGA TTT GTG GGC CCA CTG AAG CCA CGA CTG CGA CTT CCA CGA TAC TTG 


K E L G 
AAA GAA CTG GGT 
153 162 

TTT CTT GAC CCA 
F F Q T 


AscX HindXXX 

fqgggsggap* 

TTC CAG GGT GGG GGG AGC GGA GGC GCG CCG TGA T 3• 
171 180 189 

AAG GTC CCA CCC CCC TCG CCT CCG CGC GGC ACT A 5* 


k^lelfrkdmasny 
AAA GCT CTG GAA CTG TTC CGT AAA GAC ATG GCT TCT AAC TAC 

117 126 135 144 

TTT CGA GAC CTT GAC AAG GCA TTT CTG TAC CGA AGA TTG ATG 

pSQFQETFVHSRVV 
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Fig. 19: Peptide sequence 

peptide as a functional domain 


of an endotoxin-neutralizing 


1 11 
RWKVRKSFFKL Q 


Fig. 20: N-terminal EcoRV-EcoRl cassette encoding an 

endotoxin-neutralizing peptide 


EcoRX 

Ec°HV r w R v R K s F F K L Q E F 

5 • ATC ATG CGT TGG AAA GTT CGT AAA TCT TTC TTC AAA CTG CAG GAA TTC 

3 • TAG TAC GCA ACC TTT CAA GCA ITT AGA AAG AAG TTT GAC GTC CTT AAG 5 


3 ‘ 


Fig. 21: C-terminal Ascl-Hindlll cassette 

endotoxin-neutralizing peptide 


encoding 


an 


5 1 


3 • 


Aucl _ 

APRWKVRKSFFK 

GCG CCG CGT TGG AAA GTT CGT AAA TCT TTC TTC AAA 
9 18 27 36 

CGC GGC GCA ACC TTT CAA GCA TTT AGA AAG AAG TTT 


HindiII 

L Q 

CTG CAG TGA TAA 3* 
45 

GAC GTC ACT ATT 5' 


Fig. 22 Ascl-HINDIII Gene cassette encoding a purification 
tail for IMAC 


AscI 111 

aphhhhhh* • 

5 * GCG CCG CAC CAC CAC CAC CAC CAC TGA TAA 3 * 
9 18 27 

3 1 CGC GGC GTG GTG GTG GTG GTG CAC ACT ATT 5 * 
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rig. 23 Protein sequence of the platelet aggregation 
inhibitor decorsin as a functional domain 


1 11 21 31 

APRLPQCQGD DQEKCLCNKD ECPPGQCRFP RGDADPYCE 


Fig. 24 N-terminal EcoRV-EcoRI cassette encoding the 
platelet aggregation inhibitor decorsin 


rlpqcqgddqekcl 
CGT CTG CCG CAG TGC CAG GGT GAC GAC CAG GAA AAA TGC CTG 

18 27 36 45 _ 54 

GCA GAC GGC GTC ACG GTC CCA CTG CTG GTC CTT TTT ACG GAC 


CNKDECPPCQCR FPRGDA 

TGC AAC AAA GAC GAA TGC CCG CCG GGT CAG TGC CGT TTC CCG CGT GGT GAC GCT 

63 72 81 90 99 108 

ACG TTG TTT CTG CTT ACG GGC GGC CCA GTC ACG GCA AAG GGC GCA CCA CTG CGA 


ZcoRX 

D P Y C E F 
GAC CCG TAC TGC GAA TTC 3‘ 
117 126 

CTG GGC ATG ACG CTT AAG S' 


EcoRV 

D I A P 
GAT ATC GCT CCG 
9 

CTA TAG CGA GGC 


Fig. 25 C-terminal Ascl-Hindlll cassette encoding the 
platelet aggregation inhibitor decorsin 


apaprlpqcqgdd qekcl 
CCG CCG GCT CCG CGT CTG CCG CAG TGC CAG GGT GAC GAC CAG GAA AAA TGC CTG 

12 21 30 39 48 57 

CGC GGC CGA GGC GCA GAC GGC GTC ACG GTC CCA CTG CTG GTC CTT TTT ACG GAC 

CNKDECPPCQCRFPRGDA 

TGC AAC AAA GAC GAA TGC CCG CCG GGT CAG TGC CGT TTC CCG CGT GGT GAC GCT 

66 75 84 93 102 111 

ACG TTG TTT CTG CTT ACG GGC GGC CCA GTC ACG GCA AAG GGC GCA CCA CTG CGA 

HindXXX 

D P Y C E * 

GAC CCG TAC TGC GAA TGA TAA 3• 

120 129 

CTG GGC ATG ACG CTT ACT ATT 5' 
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